The objective of this notebook is to show some graphs that can be made by means of plotly using a dataset of netflix movies. Trying to find what is the type of movie that is most repeated in the dataset.

hola

Netflix, Inc. is an American streaming platform and entertainment company. Located in Los Gatos (California), the company was created in 1997 and a year later it began its activity, offering a DVD rental service through the mail.3 Currently, Netflix participates in the production of audiovisual works, from the creation or acquisition of the product until its worldwide distribution.

Missing values

Replace values

column analysis

Most of the movies were made after 2016, this means that netflix usually buys recent movies and almost doesn't buy movies from before 2010. This seems to be because netflix made its leap to fame around 2010 and from then on. growing more in popularity, which made him buy the movies of recent years.

The oldest movie is from 1942 and the newest from 2021, most of the movies are distributed between 2012 and 2018, the average being 2016.

Most of the movies were added between July and they almost never add movies in February digging into data netflix prefers to upload their movies on holidays and weekends and that is when people have time to watch them and thus give them a great rating .February may be the one in which fewer movies are added because it has fewer days.

Most of the movies last between 90 and 100 min, which is equivalent to around 1 hour 30 minutes. It is rare that there are movies that last more than 200 minutes. This will happen because the movies have found in 1:30 enough time to tell the plot and without letting the viewer get bored. Most people go to the movies for distraction and in a way they want to see concise stories but not so long, although there are cases in which the movies take around 2 hours and they are good, the normal thing is that they do not exceed 1 with 40 minutes.

In the count of the duration by movies we can notice that the maximum number of times that the duration is repeated is 152, that is, 152 movies have the same duration, although on average the movies share the same duration is 13, which It means that there are many movies that have different durations, most movies share the range of 13 to 39 times the same duration.

This tells us that most of the movies have a different duration time among themselves and that most of the studios or producers have an estimate of time which is the one that repeats in the duration of the movies that netflix adds .

There are 121 countries that have added movies to netflix. Most of the movies are from the United States, which in addition to being the country of origin of netflix is also the third country that generates the most movies in the world, of which the majority are made in by hollywood. The second country that appears with 12.3% of the movies is India, the country that produces the most movies in the world. The third country that produced the most movies was United Kingdom.

Most of the cast appear only in one or two movies so I only took the 20 actors who have appeared in more movies than doing a separate investigation I could tell that the vast majority of them are from India. It is because India is the foreign country that contributes the most movies.

Most of the categories refer to foreign categories since 60% of the films that were added were filmed outside the United States

Most directors have only directed one of the movies added while very few directors have directed more than 15 movies.

It seems that the same directors direct the vast majority of movies this could be because netflix likes those directors or it is due to the level of competition in the film industry worldwide since netflix publishes movies from all over the world and the same directors are the ones who direct most movies.

A sign that the same directors are the ones who direct could be seen in the top 10 there is not a single dominant director in addition to the fact that the percentages are not so different between that is, that among the directors with the most films they are all equally divided although they are from different countries India, Mexico, United States or United Kingdom

MULTI-VARIABLE ANALYSIS

comparing different variables we can realize that there is no clear pattern between the dates and the categories.

The category that has added the most movies is TV-MA followed by TV-14 and that most of these were added from 2017 we can see that in the year 2020-2021 neither UR movies nor NR movies have been added the latest movies NR were added in 2016-2017. Taking into account that Netflix has a TV-Ma rating as an adult category, we can see that most of the movies added to the catalog in the last year are in that category, it could be because that is the public that consumes netflix the most in addition to the following categories are content for adolescents and the least they add is content for children.

The evolution of how the movies were added is captured on a map.

Rating added by years

Children's movies have had an increase in the aggregation of movies every year except for 2018 in which it had a decline of 25% which was increased in the following year in addition to creating a new record of movies added until that year. It continued with the same number of movies added until the year 2021 where it added more movies than the movies that had been added in the previous two years together.

In movies for young people are TV-14 and PG-13 TV-14 movies were added since 2011 while PG-13 movies were added from 2015 have added much less PG-13 movies than TV-14 movies. In the year 2018 alone, more TV-14 movies were added than all the PG-14 movies up to that year.Although we can note that since 2018 the TV-14 movies that have been added have decreased

Most of the movies added are focused on TV-MA. The UR and NR categories stopped being added in 2019 while the TV-MA and R categories are more consistent in the fact that they still add those categories.

Conclusions: The most likely thing you will find on netflix is a US movie with an adult rating added between 2018 and 2020, starring Samuel L. Jackson or Adam Sandler. With an average duration of an hour and a half.